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MANATEE VOCALIZATION DETECTION METHOD AND SYSTEM 

Origin of the Invention 
5 The invention described herein was made in the 

performance of official duties by an employee of the 
Department of the Navy and may be manufactured, used, 
licensed by or for the Government for any governmental 
purpose without payment of any royalties thereon. 

10 

Field of the Invention 

The invention relates generally to detection of 
underwater sounds, and nore particularly to a method and 
system for detecting manatee vocalizations in the water. 

15 

Background of the Invention 

The West Indian manatee is a large, plant-eating 
aquatic mammal that congregates for much of the year in 
shallow coastal areas and warm water canals. Unfortunately, 
20 these are also the locations traversed by many recreational 

and commercial vessels. As a result, the manatee is being 
injured and killed by collisions with vessel hulls and 
propellers . 

In an effort to reduce such collisions, a variety of 
25 solutions have been proposed and/or tried. The solutions 

include regulating vessel speeds in areas frequented by 
manatees, performing aerial surveillance during specified 
times (e.g., boat races, times of high boat traffic, during 
times of high performance boat/motor testing, etc.,), 
30 instituting a moratoria on permits for marinas and other 

facilitates in areas frequented by manatees, and various 
forms of manatee detection schemes such as use of volunteer 
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"spotters", signals between boaters and acoustic detection. 
Obviously, a robust mechanized method of manatee detection is 
the preferred approach. However, the high noise environment 
of shallow coastal waters has, thus far, led to acoustic 
detection schemes having high false alarm rates. Detection 
schemes with high false alarm rates lead to unnecessary boat 
maneuvering and, over time, a loss of user confidence in the 
detection scheme. 

Summary of the Invention 

Accordingly, it is an object of the present invention 
to provide a method and system for detecting the presence of 
manatees in a water environment. 

Another object of the present invention is to provide a 
method and system for detecting the presence of manatees in 
high-noise shallow-water environment. 

Other objects and advantages of the present invention 
will become more obvious hereinafter in the specification and 
drawings . 

In accordance with the present invention, the detection 
of manatee vocalizations begins with the detection of sounds 
in an underwater region. After a digital representation of 
the sounds is generated, sounds at unwanted frequencies are 
filtered out resulting in the generation of filtered data in 
a frequency range of interest. A plurality of overlapping 
time-windowed portions of the filtered data are then formed. 

For each time-windowed portion, a power spectrum for 
frequency bins in the frequency range of interest is 
estimated. Each such power spectrum is then normalized using 
an average of power spectrums calculated over a plurality of 
the frequency bins. As a result, a normalized spectrum. value 
is defined for each frequency bin with each normalized 
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spectrum value associated with broadband noise having a value 
that is less than that associated with narrowband tones. 
Next, for each of the overlapping time-windowed portions, 
normalized spectrum values indicative of narrowband tones 
5 that occur (i) in windows of frequency bins encompassing 

frequencies lower and higher than each of a lowest resonant 
frequency (associated with manatee vocalizations) and its 
harmonics and (ii) with a specified degree of variance with 
respect to frequency separation therebetween, are filtered 

10 out. As a result of this filtering step, a resulting number 

of normalized spectrum values indicative of narrowband tones 
are maintained. Each normalized spectrum value that is not 
filtered out is assigned to a f requencybased harmonic set. 
A scoring routine is applied that uses the normalized 

15 spectrum value in each harmonic set. A manatee vocalization 

is indicated when a threshold score is achieved across at 
least a portion of the overlapping timewindowed portions. 

Brief Description of the Drawings 

2 0 Other objects, features and advantages of the present 

invention will become apparent upon reference to the 
following description of the preferred embodiments and to the 
drawings, wherein corresponding reference characters indicate 
corresponding parts throughout the several views of the 
25 drawings and wherein: 

FIG. 1 is a block diagram of a system used for the 
detection of manatee vocalizations in accordance with the 
present invention; and 

FIG. 2 is a top-level flow diagram of a method used to 

3 0 detect manatee vocalizations in accordance with an embodiment 

of the present invention. 
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Detailed Description of the Invention 

Referring now to the drawings, and more particularly to 
FIG. 1, a system for detecting manatee vocalizations in 
accordance with the present invention is shown and is 
5 referenced generally by numeral 10. In general, system 10 

includes one or more hydrophone ( s ) 12 placed in anunderwater 
region of interest, e.g., in the vicinity of a marina, in a 
shipping channel, on the hull or just below a surface vessel, 
on or towed behind a submerged vessel, etc. The analog 

10 output of hydrophone ( s ) 12 is supplied to a local or 

remotely-located signal processor 14 which performs a number 
of signal processing operations thereon. Signal processor 14 
supplies an output when manatees are detected to one or more 
alarm system (s) 16 to generate one or more of a visual, 

15 audible and tactile type of alarmsi The particular hardware 

used to construct system 10 can be selected from a wide 
variety of components as would be well understood by one of 
ordinary skill in the art. Accordingly, the particular 
hardware used to implement the present invention is not a 

20 limitation thereof. 

Signal processor 14 encompasses a number of processing 
operations which can be achieved by one or several processing 
components. Accordingly, it is to be understood that the 
separate blocks shown in the drawings are solely for purpoe 

25 of facilitating an explanation of the present invention. The 

analog output from hydrophone ( s ) 12 is first converted into a 
digital signal representation thereof at analog-to-digital 
(A/D) conversion block 141. The digital signals are then 
passed through a passband filtering block 142 so that only 

30 the broad frequency range associated with manatee 

vocalizations. Typically, this frequency range is 
approximately 1600 Hz to 
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18 KHz. 

The digitized, bandpass . filtered data is then split 
into overlapping time-windowed data blocks at time windowed 
data formation block 143. Each data block has a time 
duration less than or equal to that of a typical manatee 
vocalization or approximately 0.175 seconds or less. A power 
spectrum estimation block 144 processes each time-windowed 
data block to, in general, provide an estimate of the power 
spectrum thereof in the frequency domain. Each power 
spectrum value associated with each frequency bin is then 
normalized at block 145. The output sequence generated by 
normalization results in values having a mean of 1.0 when 
broadband noise is the input and a larger mean when the input 
is a narrowband tonal signal. A noise suppression filtering 
block 14 6 evaluates the narrowband tonal signals as a 
function of their frequency bins and operates to suppress 
those signals that are from closely-spaced frequency bins. 
Finally, the remaining narrowband tonal signals are scored at 
a rule-based detection scheme block 147. Each time the rule 
based scheme is satisfied over a number of overlapping time- 
windowed data blocks, one or more of alarm system (s) 16 are 
activated. 

Referring now additionally to FIG. 2, one embodiment of 
the method of the present invention will be explained in 
greater detail. An analog output 100 from hydrophone (s) 12 
is converted to digital data at step 102. The digital data 
is bandpass filtered at step 104 to filter out all 
frequencies other than those in the range of typical manatee 
vocalizations which typically ranges between 1600 Hz and 18 
KHz. The resulting digital data is then split into 
overlapping time-windowed data blocks at step 106. The time 
or length of each time window of data is selected to be less 
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than or equal to that of a typical manatee vocalization or 
less than or equal to 0.175 seconds. The amount of overlap 
is selected to assure the capture near the start of a manatee 
vocalization and is typically on the order of 75% of the 
5 window length. 

The time-windowed, bandpass filtered digital blocks of 
data are processed in four stages broadly illustrated by 
steps 108, 110, 112 and 114. Note that the second and third 
stages may be performed in an iterative fashion. 

10 The first stage (step 108) involves power spectrum 

estimation of each time-windowed portion of the data received 
from step 106. A variety of power spectrum estimation 
techniques can be used without departing from the scope of 
the present invention. By way of illustrative example, the 

15 well-known periodogram technique is applied in the 

illustrated embodimait . Briefly, as is known in the art, 
each time-windowed portion of the data is split into smaller 
overlapping time-windowed blocks at step 1080. Next, step 
1082 performs a Fast Fourier Transform (FFT) on each of the 

2 0 smaller windows of data to convert same into the frequency 

domain. Finally, at step 1084, an estimation of the power 
spectrum as a function of frequency bins is formed by taking 
the average of the squares of the FFTs . 

The second stage (step 110) of processing in the 

2 5 present invention involves normalizing each power spectrumin 

an effort to suppress broadband noise while maintaining 
narrowband tones indicative of typical manatee vocalizations. 

This second stage also suppresses the effects of changes in 
broadband ambient noise conditions while maintaining the 

30 narrowband structure of manatee vocalizations. 

Once again, a variety of normalization schemes can be 
used without departing from the present invention. By way of 
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1.0 whereas the presence of a narrowband tone in a frequency 
bin results in a normalized value that is (generally 
significantly) greater than 1.0. 

The third stage (step 112) of processing involves the 
application of a noise suppression technique that evaluates 
the spacing between frequency bins containing narrowband 
tones. The goal of noise suppression is to maintain 
narrowband tones indicative of manatee vocalizations while 
suppressing narrowband tones indicative of noise and 
broadband noise not removed by the normalization process. 
Typical manatee vocalizations have a lowest resonant 
frequency (at which their amplitude is high) and successive 
resonant frequencies (at which amplitude eventually falls 
off). Accordingly, when narrowband tones are indicated by 
the results of step 110, the frequency at which the 
narrowband tones occur is evaluated in terms of resonant 
frequencies of interest in an effort to reduce false alarms. 

More specifically, step 1120 provides (i.e., user 
selected/provided) one or more lowest resonant frequencies of 
interest depending on the background noise characteristics, 
type of manatee, age of manatee, sex of manatee, etc., that 
is of particular interest. Next, at step 1122, closely- 
spaced narrowband tones occurring in windows of frequency 
bins that are above and below each of the lowest resonant 
frequency and its harmonics are filtered/suppressed. As used 
herein, the phrase "closely- spaced narrowband tones" refers 
to a high degree of variance in the occurrence of narrowband 
tones within the above- noted windows of frequency bins. For 
example, if a lowest resonant frequency of 1600 Hz was 
provided at step 1120 (with harmonics at 3200 Hz, 4800 Hz, 
etc.), the closeness or variance with respect to the 
occurrence of narrovfcand tones occurring in windows of 
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frequency bins that encompass frequencies that are lower and 
higher than (but not including) each of 1600 Hz, 3200 Hz, 
4800 Hz, etc., is evaluated. A high degree of variance 
within these windows results in suppression of such 
5 narrowband tones. The size of the windows (of frequency- 

bins) and the level of acceptable or unacceptable variance 
are predetermined by the user. In step 1122, filtering or 
suppression can be accomplished by resetting the normalized 
power spectrum value at that frequency bin to a value of 1.0. 

10 The fourth and final stage (step 114) of processing 

involves scoring the narrowband tone data. Prior to applying 
the scoring routine, step 1140 assigns each remaining 
frequency bin having a normalized power spectrum value 
indicative of a narrowband tone to a harmonic set based upon 

15 the frequency of the frequency bin. Each harmonic set is 

defined by a base frequency and resonance thereof. Next, 
step 1142 orders the normalized values for each harmonic set 
based on the magnitude thereof, i.e., largest to smallest. 
Next, at step 1144, two requirements are checked for each 

20 harmonic set's base frequency. By way of example, the first 

requirement is that either (i) two or more significant 
harmonics of the harmonic set 1 s base frequency exist (e.g., 
the values at the harmonics are indicative of the spectrum 
level for the narrowband tone) or (ii) the value of the 

25 largest harmonic minus one is less than or equal to a 

percentage (e.g., 5%) of the second largest harmonic 
component minus one. The second requirement is that the 
resonant frequencies associated with the largest and second 
largest ordered values (i.e., output from step 1142) 

3 0 correspond to two of the four lowest resonant frequencies of 

a harmonic set . 

If either of these requirements is not satisfied, the 
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harmonic set is assigned a score of zero at step 1146. If 
the requirements are satisfied for a harmonic set, then the 
harmonic set has a score computed at step 1148. Such scoring 
could generate an average vaLue that is computed over the 
5 harmonic set's resonances. Note that such averaging could 

limit participation in the average to those frequencies 
having (normalized power spectrum) values that are within 
some percentage of the highest value for the harmonic set. 
With each harmonic set having a score (i.e., either assigned 

10 at step 1146 or computed at step 1148) , step 1150 selects one 

harmonic set based on its score as an output. For example, 
the harmonic set achieving the highest score could be 
selected. The above-described scoring stage (step 114) is 
performed for each of the overlapping time-windowed portions 

15 of data. If, at step 116, a threshold score is achieved 

across one or more time-windowed portions of the data thereby 
indicating the presence of a manatee, an alarm signal is 
issued at step 118. As described above, the alarm can be 
visual, audible and/or tactile in nature. 

20 The advantages of the present invention are numerous. 

Manatee vocalizations are autonomously detected in a way that 
significantly reduces false alarms. Accordingly, "manatee 
alarms" generated by the present invention will have a high 
degree of confidence associated therewith. 

25 Although the invention has been described relative to a 

specific embodiment thereof, there are numerous variations 
and modifications that will be readily apparent to those 
skilled in the art in light of the above teachings. It is 
therefore to be understood that, within the scope of the 

3 0 appended claims, the invention may be practiced other than as 

specifically described. 

What is claimed as new and desired to be secured by 
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